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ABSTRACT : 

In this paper several infinite two-person games are studied, all 

having the following common structure: Player 1 (Emitter) produces a 

binary periodic sequence. Player 2 (Predictor) observes some initial 
segment of this sequence and then tries to predict the next digit. The 
payoff to Emitter is zero if the prediction is a correct one. The games 

differ in additional assumptions — those are in particular: (1) Pre- 

dictor required to make his prediction after observing a prescribed 
number of digits of the sequence. (2) Predictor allowed to observe 
any number of digits but earning a decreasing amount for each correct 
prediction as the number increases. (3) The period of the emitted 
sequence being chosen by random from some fixed distribution. (4) 
Emitter allowed to choose the period but being paid a decreasing 
amount for incorrect prediction as the period increases. Combining 
these assumptions two zero-sum and two nonzero-sum games are obtained. 

It is shown that all these games possess a solution, some are at least 
partially solved and their further properties investigated. 
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1. INTRODUCTION. 



In this paper we study certain types of infinite games which 
belong to the class of discrete emission-prediction games, also 
known as discrete games of aiming and evasion. An emission-pre- 
diction game is a two-person game, where the first player, to be 
named Emitter, produces a sequence of elements from some fixed 
finite set A. The second player, to be named Predictor, is allowed 
to observe the sequence for some time and is then required to make 
a prediction of some kind about the future behavior of the sequence. 
The general Predictor’s goal is to make a correct prediction, while 
Emitter wants to avoid this. 

Emission-prediction games may serve as models for a variety 
of conflict situations. For instance, Emitter could be identified 
as an attacker (bomber, submarine, guided missile, guerilla 
unit) performing a series of evasive maneuvers, which, in the dis- 
crete time, represent the sequence being emitted. Predictor is 
trying to destroy the attacker. Since, in general, the attacker 
can be intercepted only after one or more time units have elapsed 
from the last observed maneuver (or position) , Predictor must be 
able to correctly predict future maneuvers (positions) of the 
attacker . 

To our best knowledge the study of emission-prediction games 
was initiated by Isaacs (the "bomber and battleship" problem 
and have since been investigated by Dubins [2], Isaacs and Karlin [5], 
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Isaacs [4], Karlin [6], Blackwell [1], Ferguson [3] and Matula [7]. 
The assumptions made by these authors can be summarized as follows: 

(1) No restriction is placed upon the emitted sequence. 

(2) Predictor is allowed to observe the sequence as long as he 
wishes before he decides to make a prediction concerning 
several subsequent terms of the sequence. 

(3) The game is a zero-sum game with the payoff depending only 
on the discrepancy between the prediction and the actual 
values . 

The emission-prediction games investigated in this paper 
differ mainly in one crucial aspect; we assume that the emitted 
sequence is a periodic one. Apart from this being of interest per 
se we have been motivated by the following idea. Suppose that the 
attacker is a simple automatic device with built-in preprogrammed 
ability to perform evasive maneuvers. The trajectory of this 
device will then follow a periodic pattern with some period depend- 
ing upon the complexity of the program and unknown to Predictor — 
the defender. Even in situations where the attacker is controlled 
by a human operator periodicity may very well serve as a first 
approximation. We believe that most humans do exhibit a kind of 
cyclic pattern when asked to perform a series of evasive, i.e. 
unpredictable maneuvers. 

Assuming periodicity of the emitted sequence, however, 
compels us to place some additional restrictions on Emitter and/or 
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Predictor since otherwise the game would not possess a solution. 

The additional restrictions considered in this paper are: the 

period is chosen by an independent chance mechanism, bounds on the 
length of observation intervals, and penalties for long periods and 
lengthy observation. As for the type of prediction required we 
limited outselves to the simplest case of predicting the next term 
of the sequence. (With the periodicity assumptions this is by no 
means trivial as it would be without it.) For the sake of simpli- 
city of notation we consider binary sequences only; most of the 
results of this paper extend easily to the m-ary case. 

Another possible application of the model may be that of 
an optimum jamming strategy for a missile defense system. Suppose 
that a ship is being attacked by a missile equipped with a target 
search radar. To avoid detection the search radar is not active 
all the time but rather is constantly being switched on and off in 
some programmed pattern. The intensity of illumination of the tar- 
get by the search radar then follows a binary periodic sequence 
(in a discrete time). The defender of the target (ship) wants to 
jam the search radar. The jamming device, however, must not be 
transmitting all the time since the missile could then home on the 
transmitter and hence the target. Thus, the defender faces the 
problem of predicting the next mode of operation of the search 
radar after observing the illumination pattern for some limited 
time, i.e., predicting next term of the binary sequence. 
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2. PRELIMINARIES. 

Throughout this paper the symbol A will denote the two- 
element set A = {0,1} and the symbol ft with generic elements 
m will denote the set of all periodic sequences of zeros and ones. 
We say that a binary sequence 



a) = (m- , . . . ) ; a) € A; n 

1 / n 

is periodic with the period tt (ui) = t; t 

a) M =o) for every n = 1,2,.. 
n+kt n J 99 

Obviously, each periodic sequence cj € ft 
we denote 



= 12 

x , z. , . . . , 

= 1,2, ... , if 

) k 1,2,... . 

has only one period, and 



ft = € ft : tt(cjd) = t} 

so that ft is a disjoint union of ft^jft^j*** • 

If a) £ ft and n = 1,2,... then m will denote the 

n 

n-th term and mj n the ordered n-tuple of the first n terms 
of the sequence m. 

Let A n be the set of all ordered n-tuples of zeros and 

ones. If a = (a- , . . . ,a ) € A n and 6 6 A then a6 is the 
1 n 

oo 

ordered (n+1) -tuple (a^ , . . . , a n > 6) and (a) is the periodic 
sequence 

oo 

a) = (a- , . . . ,a ,a- , . . . ,a ,...). Clearly, 7r((a) ) ^ n. 
Inin 
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k n 



For every a 6 A and t = 1,2,,.. we define the function 



I t (a) = 



1 if there is oo 6 such that go = (a) , 



’ 0 otherwise. 



(i) 



and the function 



X (a) = min{ t = 1,2,... : I (a) = 1}. (2) 

Thus (a) serves as an indicator of "potential periods" of a 
and A (a) is the "smallest potential period" of a. 

Later we will need the following lemma, 



Lemma 
A (aa) 
number 




x. 



For every a 6 A n there exists a £ A such that 
+ 1, where [x] denotes the integral part of the 



Proof : For any n-tuple a = (a^,...,^) denote 

T(a) = {t = l,...,n-l : I^Ca) = 1} if n > 1 
and T(a) = 0 if n = 1. Notice first that 

T(aO) fl T (al) = 0 

and (3) 

T(aO) U T (al) = T(a) U {n}. 

This follows by realizing that if t 6 T(a) then a = (a^, . . . ,a ; 
ot^, . . . >ct^, • . . . . ,ot.) , 1 2 ^ l ^ t so tha t t ^ T ( acx | ^ ) 
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if i < t and t € T(aa^) if i = t. Conversely, if t £ T ( aa i+i^ 
and t < n then t £ T(a) and clearly t £ T(aa) for a / a ±+i* 
Finally, n £ T(aa) if and only if a / . 

Let now a = (a- , . . . ,a ) and assume n > 2 since the 
1 n 

lemma is trivial for n = 1,2. If T(a) = 0 then by (3) T(aO) 

U T(al) = {n} so that A (aa) ^ n for any a £ A. 

If T(a) 4 0 then there is b = and 

c = such that 

a = (be be ... be b) and T(bc) = 0. 
m times 

If T(b) = 0 then T(a) = (k + £} and by (3) A(aa) ^ n for some 
a 6 A. 

If T(b) ^ 0 then there is e = (e^,...,e^) and d = (6^,...,6 g ) 
such that 



b = (ed ed ... ed e) and T(ed) = 0. 



Hence taking a ^ = g^ we 



have 



A(aa) ^ m(k+£) + r + s^k+£ + 2 = n- k+ 2 



and since 2k ^ n - 1 we obtain 



A (act) £ ^ + 2 > 



+ 1. The lemma is proved. 
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3. ZERO-SUM GAMES WITH RANDOM PERIOD. 

In this section we consider the case where 

(1) the period t of Emitter’s sequence is chosen randomly 
according to some distribution v known to both players, 

(2) Predictor is allowed to observe the first n digits of 
the Emitter’s sequence and tries to predict the next 
(n+l)st digit, 

(3) the payoff to Emitter is zero if the prediction is correct 
and one otherwise. 

In other words, the expected payoff is the probability of 
wrong prediction, which the Emitter is trying to maximize and the 
Predictor to minimize. 

We will denote the resulting zero-sum game by G(v,n), 
where v(t) , t = 1,2,... is the period distribution, 

00 

v(t) £ 0, t = 1,2,... ; l v(t) = 1 

t=l 

and n = 1,2,... is the length of the sequence observed by the 
Predictor . 

Next we describe Emitter’s and Predictor’s pure strategies. 
To specify a pure strategy Emitter must choose for every 
t = 1,2,... a binary sequence with the period t. Emitter’s pure 
strategies s are therefore sequences 

s = {s t , t = 1,2,...}, where s^_ € 
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The set S of all Emitter f s pure strategies is then an infinite 
Cartesian product S = ^ x ^ x of finite sets and is 

therefore uncountable. 

Predictor f s pure strategies <j> are simpler to describe. 

Each <|> is a mapping of A n into A = {0,1}. For each a G A n , 

(a) is the prediction of the (n+l)st term of the Emitter f s 

sequence if the first n terms observed are the n-tuple a. The 

set $ of all Predictor ! s pure strategies is therefore a finite 
2 n 

set of 2 elements. 

The payoff function W(s,<f>) is then given by 

00 

W(s , <)>) = l v(t) W (s , <f>) ; s € S, <)>€*, 
t=l 

where for t = 1,2,... and u) = s , 

’ ’ t 

{ 1 if d>(w| n ) i w n+1 , 

0 if <Kw| n ) = “ n+1 - 

It is easy to see that W t (s,<j)) is nothing but conditional proba- 
bility of wrong pi diction given that the period is t and hence 
W(s,<j)) is the unconditional error probability. 

The game G(v,n) is now formally defined by the triplet 
(S,$,W). This is an infinite zero- sum game, however, since the set 
$ is finite and the layoff function is nonnegative and bounded by 
one we have by the well-known theorem of Wald [8] the following 



proposition . 
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Proposition 3.1 : The game G(v,n) has a value and Predictor has 

an optimal strategy. 

Let us turn our attention to the existence of Emitter's 
optimal strategy. By another Wald ! s theorem ([8], Theorem 2.20) 
an optimal strategy exists if the corresponding space of pure strate- 
gies is totally bounded with respect to the intrinsic metric 
([8], sec. 2.1). Now the space S of Emitter's pure strategies 
is totally bounded in the above sense if for every e > 0 there 

exists a finite subset S of S such that for every s € S 

e 

there is an r £ S for which 

e 

max|w(s,<j)) - W(r,<j>)| < e. 

It is easy to see that this condition is satisfied for the game 

G(v,n); take an integer T such that £ v(t) < e and define 

t>T 

S to be the set of all pure strategies s = {s , t = 1,2,...} 
e t 

with s fixed for t > T. Thus we have proved the following 
proposition. 

Proposition 3.2 : In the game G(v,n) Emitter has also an optimal 

strategy . 

In general, optimal strategies will be mixed strategies, 
that is probability distributions on the corresponding spaces of 
pure strategies. 

Let X be the space of Emitter 's mixed strategies. Since 
S is a product space a mixed strategy x £ X will be a product 



probability measure 
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X = x x 2 X ... 9 

where x^ is a probability distribution on the finite set 
In other words, x € X is completely specified by a sequence of 
probability distributions x^_; t = 1,2,... . The payoff function 
W then extends onto X x $ by 



W(x,<f>) 



W(s , <|>)dx(s) 

j 

S 



I v(t)W t (s,<())dx(s) 



= I v(t) 



t=l 



W (s,cf))dx(s) 
) t 

S 



I V(t) l W (s , <|) )x (s ) . 

t=i s t €n t 



Predictor's mixed strategies y £ Y are again probability 
distributions on the finite set Thus for every such y and 

s £ S we can define 



00 

W(s,y) = £ W(s,<J>)y(<t>) = £ v (t) I w t (s,<J>)y(<|>) . 

t=l <|>€4> 

Let y € Y and let for every a 6 A n , a 6(0,1} f (a) = £ y(<f>) 

a {<(> :<j> (a)=ot) 

Clearly, f^(a) as t ^ ie Probability of predicting a if the 
n-tuple a has been observed and if Predictor uses the mixed 
strategy y. Thus f = (f^,f^) could be called Predictor f s 
behavioral strategy and we have for a) = s^_ 
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r f o^i n > 


if 


“n+l 


= 1, 




W t (s,f) 


= l W (s *)y(*) = 










(4) 








if 


“n+1 


= 0 . 




Conversely, if 


f = (fg.fp where 


f maps 


A n 


into 


[0,1] 


and 


f 0 (a) + f^a) = 


1 for all a € A n 


then 










y(<f>) = 


I f 0 ( a ) + 


I f x ( a ) 


> 


* € 







{a£A n :<t>(a)=0} {a€A n :<J> (a)=l) 

is a mixed strategy and (4) holds. Hence we can work with the set 
F of Predictor’s behavioral strategies instead of with the set Y. 

Proposition 3.3 : The value v of G(v,n) is given by 

00 

v = min £ v(t) max max{I (aO)f (a) ,1 (al) f_(a) } (5) 

f€F t=l a€A n t i t u 

& 

and f is Predictor’s optimal strategy if and only if it minimizes 
the series above. 



Proof : 



Now 



Since G(v,n) has a value we must have v = min sup W(s,f). 

f (EF s6S 



sup W(s ,f ) 
sGS 



l v(t) max W (s,f) , 
t=l s t ©l t 



and by (4) 



max 

s €« 



t 



W t (s,f) 



= max{f^_^(a) } , 
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where the latter maximum is taken over the set of all (n+1) -tuples 
aa £ A n+ ^ for which U)j = aa for some a> 6 i.e. for 

which I^(aa) = 1. Hence 

max W (s,f) = max max{I (aO)f_(a),I (al)f^(a)} 
s t €tl t C a€A n t; 1 t 0 

and the second assertion follows from the existence of Predictor f s 

optimal strategy. 

Setting f (a) = for each a 6 A n in (5) we obtain 



Corollary : 



0 ^ v 



1 

* 2 



Proposition 3.4 : There exists Predictor 1 s optimal strategy f 

I'l ^ ^ 

such that for each a £ A either f (a) = (1,0) or f (a) = (0,1) 

* 11 
or f (a) = (~~), 

Proof : Let for every t = l,...,n+2 and a 6 A n 

B(t,a) = { f^ : f (a) = max^ max{ I (aO) f (a) , I (al) f Q (a) } } , 

a£A n 

where f^ denotes the vector with 2 n components f^(a). Clearly, 

B(t,a) is a convex polyhedron contained in the 2 n -dimensional 

unit hypercube. Furthermore, the vertices of B(t,a) are vectors 

with components 0 or 1 or 1/2 only. 

Let u = (a.. , . . . ,a . n ) , a. 6 A n and let 
1 n+z l 

n+1 

L u ( V ■ i v(t)f 0 (a t ) + f 0^ a n+2 ) l v(t) - 

t=l t>n+l 



13 



Now L is a linear function of f_ and hence its minimum over 
u ~"0 

the convex polyhedron 



n+2 

B u = 0 B(t,a t ) 

is attained at one of the vertices of B^. Denoting _f^ the 
vertex at which the minimum is attained we see that for each a £ A n 

f^(a) =0 or 1/2 or 1. (6) 

Finally, since I^Cacx) = 1 for t > n + 1, 



min £ v(t) max max{I (aO) f - (a) , I (al) f n (a) } 
f t=l a€A n t 1 t U 



= mm 
u0V 



n(n+2) 



min L (f^) = min 



^ €B u 



u -H) 



u€A 



n(n+2) 



L (f?) 

u —0 



= L 



(f 



), 



where u* minimizes 



L u<# 



u* u* u* 

It follows from Proposition 3 that f = (f^ , 1-f^ ) an 

optimal strategy which together with (6) terminates the proof. 

Proposition 3.5 : Let £ v(t) > -j. Then v = y and 

1 1 t>n+l ^ 

f(a) = (— ,— ) , a 6 A is the unique Predictor ! s optimal strategy. 
* 

Proof : Let f be an optimal strategy and assume that for some 

a 6 A n , a € {0,1} f*(a) + 



It follows from the proof of 
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Proposition 3.4 that there must be another optimal strategy with 

k -k 

f (a) =0 or 1. Hence assume that f (a) = 1 (otherwise take 
a a 

f (a) ) . Then 
l-oi 



l v(t) max^ max{ I (aO) f 1 (a) ,1 (al)f_(a) } 
t=l a£A 



k k 

^ max max{f (a),f n (a)} £ v(t) > — 

a€A n 1 U t>n+l 



1 * 1 
so that v > y > which is a contradiction. Hence f^(a) = *2 ^ or 

all a € A n , a <E {0,1}. For this f 



oo 

v = £ v(t) y max max{l (aO) ,1 (al) } = 4 

t=l a€A n 



since for every t = 1,2,... there exists a G A n and a £ {0,1} 



such that I^(a ) = 1. 



Proposition 3.6: Let the distribution v be such that 



v([y] + 1) > l v(t). 



t> [f]+l 

where [y] is the integral part of Then 



(7) 



t>[f]+l 



V 



v(t) 



( 8 ) 
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and the Predictor s behavioral strategy f defined by 



X(aa) £ [-|] + 1 =» f*(a) = 0 



(9) 



is an optimal strategy. 



Proof : Let us assume first that n is odd so that [-“] + 1 = 

Let e £ A n be the n-tuple which has 1 at the th place 



and zeros elsewhere. Clearly 



n+1 



I^CeO) =1 if an only if t > — — , 



and 



I^Cel) =1 if and only if either t = or t > n + 1. 



Hence 



l v (t) max n max{I t (aO)f 1 (a) ,I t (al)f 0 (a)} 
t=l a£A 



k l v(t) max{ I (eO) f .. (e) ,I (el)f f .(e)} 
t=l c x c u 



= f Q (e) + f- L (e) £ v(t) + max{f ;L (e) ,f Q (e) ) l v(t) 



n+1 



< t^n+1 



t>n+l 



^ £ v(t) by the hypothesis. Thus v ^ £ v(t) 



t>- 



n+1 



n+1 



Next let f be the strategy (9). Then by Lemma 2.1 
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vi l v(t) max max{l (aO)f!'(a),I (al)f*(a)} 
t=l a€A 1 C 



Hence v 



y v(t) max 
L . , ^ n 



n+1 



a£A 



t>- 



I v(t) 
n+1 



t>- 



1 v(t) and 
n+1 



max{l t (aO)f^(a) ,I t (al) fg(a) } 



& 

f is an optimal strategy. 



If n is even we begin with e having 1 at the (—+l)st place. 
The rest follows verbatim. 



If the hypothesis of the above proposition is not satisfied 
the game G(v,n) is very difficult to solve. As seen from Propo- 
sition 3.3, this amounts to solving the nonlinear programming 
problem: 



"minimize £ v(t) max max{I (aO)f (a) ,1 (al)f n (a)} (10) 

t=l a€A n t 1 t U 

subject to f € F." 

It is true that it can be solved by breaking it into a number of linear 
programming problems as we did in the proof of Proposition 3.4 but 
the size of this task is still formidable. A slight simplification, 
however, can be obtained by considering truncated versions of 
G(v ,n) . 
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T 

Let T = 1,2,..., let G^(v,n) be the game (S,$,W ) 
obtained from G(v,n) by truncating the distribution v at T, 
that is setting v(t) = 0 for t > T and normalizing to one. 

T T T 

Proposition 3.7 : Let v , f and x be the value and optimal 

strategies respectively of the truncated game G T (v,n). Then, as 
T °°, 

X x k t * 

v v, f -*■ f and x -*■ x , 

k k 

where v, f and x are the value and optimal strategies of 
G(v,n) . 

Proof : Since v is a probability distribution and 

|w T (x,f) - W(x,f ) | £ l v(t) 

t>T 

T 

W (x,f) converges to W(x,f) uniformly in both f and x, which 
implies the statement. 

To the end of this section we present a solution of a small game 
G(v,n) . 

Example : Let n = 5, v(t) > 0 for t = 1,...,5 and v(t) = 0 

for t > 5. To compute the right-hand side of (5) we need the 
values of I^Caa) for t=l,...,5, a € , a =0,1. These 

are given in the first two columns of Table 1. Substituting into 
(10) we obtain 
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v(l)f 1 (a 1 ) + v(2)f 0 (a i;L ) + v(3)max{f Q (a 5 ) , f 1 (a 1Q ) , f^a^) } 



+ v(4)max{f 1 (a 3 > ,f 1 (a 5 ) > f 0 ^ a 9 ^ }f 0 ^ a 13^ ’ f 0 ^ a l5^ * 



+ v(5)max{f^(a^) ; i = 2,..., 16} 



+ v (1) f q ( a 3 2> + + v ( 3 ) max ^ f 1 ( a 28^ ,f 0^ a 23^ ,f l^ a l9' > ^ 



+ v(4)max{f 0 (a 30 ),f 0 (a 28 ),f 0 (a 26 ),f 1 (a 24 ),f 1 (a 20 ),f 1 (a 18 )} 



4* v (5)max{fQ(a i > : i = 17, .•.,31}, 



( 11 ) 



Looking at the expression (11) as a function of 64 nonnegative 

variables f,(a.) constrained to satisfy f^ (a . ) + f- (a . ) = 1 we 
2 i 0 x 1 l 

see that the only variables that make the minimization difficult 
are those for which both an ^ appear in (11). 

Hence it is easy to conclude that the minimum of (11) , the value 
v of the game, is equal to the minimum of 



v(2)u^ 4* v(3)u^ 4* v(4)max{u^,l-U2} + v (4)max{l-u^ jl-^jl-u^} (12) 

over all u^,u^,u^ satisfying 0 ^ u £ 1, j = 1,2,3. 

The minimum of (12) is found to be the smallest of the four numbers: 
v(4) 4* v (5) , v(3) 4* v (5) , v(2) 4* v(3) 4- v(4), ^(v (2)4*v (3)4*v (4)4*v (5) ) . 

Hence, the value v of the game is: 



Case 1 : v(4) ^ v(3) and v(4) 4- v(5) ^ v(2) 4- v(3) 



v = v(4) 4- v(5) 
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Case 2 : v(3) £ v(4) and v(3) + v(5) £. v(2) + v(4) 

v = v(3) + v(5) 

Case 3 : v(2) + v(3) + v(4) ^ v(5) 

v = v(2) + v(3) + v(4) 

Case 4 : v(2) + v(3) - v(5) £. v(4) £ v(3) + v(5) - v(2) 

v = -|(v(2)+v(3)+v(4)+v(5)) = i(l-v(l)) 

Predictor f s optimal strategies for each of the four cases are given 
in Table 1. 
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4. ZERO-SUM GAMES WITH PENALTY FOR LONG PERIODS. 

We will now relax the restrictions on Emitter and let him 
choose freely the period of the sequence emitted. We will, however 
impose a penalty on Emitter for choosing too large periods. The 
game then proceeds as follows: 

(1) Emitter chooses a period t = 1,2,... and a periodic 
binary sequence with the period t. 

(2) Predictor is allowed to observe the first n digits of that 
sequence and tries to predict the next (n+l)st digit. 

(3) The payoff to Emitter is zero if the prediction is correct 
and is equal to a nonnegative constant c(t) if the predic- 
tion is wrong and the emitted sequence has period t. 

The sequence c(t), t = 1,2,... represents the penalty and we 
assume that for all t = 1,2,... 



To avoid the trivial case we assume that c(t) > 0 for at least 
one t . 

The resulting zero-sum game is then specified by the 
sequence (13) and an integer n = 1,2,... . We will denote it by 
G (c (t) ,n) . 

The space S of Emitter’s pure strategies s now consists 
of ordered pairs 



c(t) ^ c(t-KL) ^ 0 



(13) 



s = (t,s^) where t = 1,2,. 



and s^ 6 • 



Hence S is countably infinite. 
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The space $> of Predictor's pure strategies as same as 
same as before. 

The payoff function W(s,<|>), s £ S, <j) £ $ is for 
s = (t,s t ), s t = co given by 



W(s,<}>) 



c ( t ) if <t>(u| n ) T 4 



Cl) 



n+1’ 




if <J) (co | n ) 



u - . 
n+1 



(14) 



Introducing again Predictor T s behavioral strategies f we have 
now 



W(s,f) = 



c(t)f 0 (cu| n ) if 
cCtJfj^Cwl 11 ) if 



"n+1 = 1 ’ 



“n+1 ' °' 



Since the space $ is finite and by (13) the payoff function is 
bounded we have 



Proposition 4.1 : The game G(c(t) ,n) has a value and Predictor 

has an optimal strategy. 

This time, however, we can obtain explicit expressions. 



Proposition 4.2 : The value v of G(c(t),n) is given by 



v 



c (A (aO) ) c(A (al) ) 
c(A (aO) ) + c(A(al)) 



and the strategy 



* 

f defined for all 



a € A n by 



( 15 ) 
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f *, , c (A (aO) ) 

0 W c (A (aO) ) + c (A (al) ) 

is Predictor ! s optimal strategy. 



(16) 



Remark: Expressions — are to be interpreted as 0 in (15) and 

as any number x, 0 £ x ^ 1 in (16). 



Proof: Let f € F. 



sup W(s,r) 
s(ES 



sup 

t=l , 2 , . • 



{ c ( t ) max 
a£A 



n 



max[I t (al)f Q (a) ,I t (aO)f 1 (a) ] } 



= max max{f (a) sup c(t)I (al),f (a) sup c(t)I (aO)}. 
a€A n U t=l,2, . . . t 1 t=l,2, . . . 



From (13) for a = 0,1 

sup c(t)I (aa) = c(A(aa)). 
t = l , 2 , . . . 

Hence 



where 



v = min sup W(s,r) = max h(a) , 
f £F s€S a€A n 



h(a) = min max f (a)c(A(aa)) . (17) 

f (a) a=0,l a 

However, h(a) is easily recognized to be the value of the two- 
by-two matrix game with the matrix 



H (a) 



0 c (X (aO) ) ' 

c. (A (al) ) , 0 



so that (15) and (16) hold and the proposition is proved. 
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Let us now turn our attention to Emitter T s strategies. Since 
the space S is countable, the space X of Emitter ! s mixed strat- 
egies is simply the set of all probability distributions on S. 

It is, however, more convenient to define a mixed strategy x € X 
as the pair 



X = (C , { V* t >) > 



where £ = is a probability distribution on positive 

integers and y are distributions on finite sets ft . Clearly, 
if s = (tjS^) (E S and x € X then x(s) = £ y (s t ) . 

* * * n 

Proposition 4.3: Let a = (a., , . . . ,a ) € A be such that 

n 



h(a ) = max h(a) , 



a€A 



n 



(18) 



where h is defined by (17), let for a = 0,1 



t^ = X (a a) , 



and 



r ta (C l’’** 55 ta ) € ft t 



k k k 

Then Emitter's mixed strategy x = (E, >{y t ))> where 



( C(t,)[c(t 0 ) + c(t 1 )] 1 if t = t Q , 
5 t = ) c ^ t 0^ c ^ t o^ + 1 if t = t 1 . 



0 



otherwise, 
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and 



( 1 if s = r t = t , a = 0,1, 

l t t a 



U t (s t ) - 






arbitrary otherwise, 



is an optimal strategy. 



Proof ; Notice first that since by Lemma 2.1 



max{A (aO) ,A (al) } > 



+ 1 



the (n+l)st digits of the two sequences m = r and u) f = r 

c o c i 

must be different. Let now cf> £ $ . We have 



* 



W(x ,4>) 



l W(s,<f>)x (s) = £* c (tO) | <p (a*) - oj 1 1 
s€S c 0 n 



+ C^cCt^UCa") - u>^ +1 l 



* 

£ min £ c ( t „) 

a-0,1 a 



c(t Q )c(t 1 ) 

c(t Q ) + c(t 1 ) = V} 



and since this is true for any (p 6 $ , x is an optimal strategy. 



Remark : The results of this section remain true even if the time 

when Predictor is required to make a prediction is allowed to 
depend on the observed sequence in some predetermined fashion. 

For instance, we may ask Predictor to predict the next digit as 
soon as he observes a certain number k of ones or as soon as 
the number of observed digits reaches some n, whichever occurs 



first . 
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The space $ of Predictor’s pure strategies is then the 
set of all mappings <(> from a finite set B into {0,1}, where 
the set B is a finite collection of finite strings of zeros and 
ones with the property that for every binary sequence m £ ft 
there is a string b € B, which is the initial segment of oj. 

The set B is determined by the requirement imposed upon 
the prediction time. For instance, in the case mentioned above 
the set B will consist of all strings of at most n zeros and ones 
which contain at most k zeros. 

The only modification needed in order to extend the results 
of this section to this more general case is to replace the set 
A n by the set B. 

Example : Consider the game G(c(t),n) with n = 5 and 

( -^(6-t) if t = 1, . . . ,6, 
c(t) = < 

\ 0 if t > 6. 

To solve the game we first compute X (a^a) for each a_^. £ a\ 

a = 0,1. These are in the first two columns of Table 2. Then 

compute c(A(a^a)) for (19) and h(a^) from (17) - see Table 2. 

Hence the value v of the game G(c(t),n) is v = 1.5 

& 

and Predictor’s optimal strategy f as computed from (16) is in 
the last column of Table 2. Emitter’s optimal strategies are com- 
puted from Proposition 4.3. We obtain 
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.25 if t = 3, 



€ t = \ -75 if t = 5, 



0 otherwise, 



and 

y*({ (011)°°, (100)°°}) = 1, y* ({ (01101) , (10010)°°}) = 1 



and arbitrary otherwise. In other words. Emitter should produce 
the sequence 011011011... or 110110110... with probability 
1/4 and the sequence 0110101101... or 1101011010... with 
probability 3/4. 
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5. NONZERO-SUM GAMES. 

In the previous two sections we dealt exclusively with 
zero-sum games. That is, the payoff to Predictor was always 
assumed to be the negative amount of the payoff to Emitter. 

Let us now modify the game G(c(t),n) of the previous 
section by adding to its description the assumption: 

(4) The payoff to Predictor is one if his prediction is a correct 
one and zero otherwise. 

We obtain a new nonzero-sum game 

G’(c(t),n) = (S,$,W 15 W 2 ), 

where the strategy spaces S,$ and Emitter* s payoff function 
are same as in the zero-sum game (G(c(t),n). The Predictor's 
payoff function W^ is defined by 

( 1 if <t>(w| n ) = w n+1 > 

w 2 (s,4>) = 1 

\0 if <j>(a)| n ) f w n+1 > 

where oj is the sequence s^. 

Proposition 5.1 : The nonzero-sum game G*(c(t),n) has an equili- 

k k ^ 

brium point (x ,f ), where f is defined by (16) and x is 

* ± 

defined as in Proposition 4.3 with the exception that £ ~ 5. =1/2. 

L 0 1 

Furthermore, all equilibrium points are equivalent and interchangeable 
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yielding the minimax payoffs v and 1/2 to Emitter and Predictor 
respectively . 

k 

Proof : Let a be defined by (18) . Then for all f £ F by the 

* 

definition of x 



W 2 (x ,f) 



so that trivially 



sup W (x ,f) = W (x ,f ) = f. 
f€F 



Since f is a minimax strategy in G(c(t),n) 



* * k * I 

% f o (a } + V i(a } 



* k 1 



W^s.f*) 



£ V 



for any s 6 S. 

Now by the definition of 



* 

x and 



* 

f 



JL JU *.1* JL JU «jU JU 

W x (x ,f ) = c(t Q )5 t f 1 (a ) + c(t 1 )? t f q ( a ) 

= c(t 0 )c(t 1 > [c(t Q ) + c(t 1 >] 1 = h(a ) = v. 



Hence 

sup W (s,f ) = W (x ,f ) = v, 
s€S 

k k 

which together with (20) proves that (x ,f ) is an equilibrium 
point yielding minimax payoffs. 
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Finally it is easy to see that no equilibrium point can 
yield payoff greater than minimax for either of the players since 
this would decrease the other player f s payoff below his minimax 
value. Interchangebility is obvious. 

The last type of an Emission-Prediction game considered in 
this paper is the most general one and is obtained by further 
relaxing the restrictions on Predictor. 

(1) Emitter chooses a period t = 1,2,... and a periodic 
binary sequence with this period. 

(2) Predictor is allowed to observe as many digits of the 
sequence as he wishes and after he decides to cease 
observation he is required to predict the next digit. 

(3) The payoff to Emitter is zero if the prediction is 
correct and is equal to c^(t) ^ 0 if the prediction 
is wrong and the sequence has period t. 

(4) The payoff to Predictor is zero if his prediction is 
wrong and is equal to c^Cn) ^0 if it is correct 
and the length of the observed segment was n. 

The constants c^(t) and C 2^ n ^ represent penalties for long 
periods and lengthy observation. We assume that 

t = 1,2,... =* 1 = c 1 (l) ^ c 1 (t) ^ c 1 (t+l), lim c^t) = 0 

t->oo 



and 
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n = 0,1,... => 1 = c^CO) ^ c 2 (n) ^ c^Cn+l), lim c^(n) = 0. 



n-x» 



We will denote the resulting nonzero-sum game by G(c^(t), 



c 2 (n)). 



The space S of Emitter f s pure strategies in this game is 



same as before. Predictor's pure strategies i[> £ V are pairs 



where B is a finite set of finite strings of zeros and ones 
such that for every oo € ft there is some b (E B which is an 
initial segment of a) . (f> is a mapping from B into {0,1}. 

Clearly, the set ¥ of all Predictor T s pure strategies is now 
countably infinite. 

Let s = (t,s t >, = (B ,(j>) , a) = s , b £ B be the 

initial segment of u), and let n be the length of b. The 
payoffs are now given by: 



= ( 2,40 , 




(n) if <|>(b) = 0) n+1 , 



W 2 (s,ip) = 1 



n+r 



0 



if <f>(b) t w 
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Proposition 5.2 : The game (t) (n) ) has an equilibrium 

point . 



Proof : Let for j = 1,2 



p. (s,s f ) = sup|w.(s,ifi) - W . (s f ,ip) | and 
J,x ipev 3 3 



p. ,(*,*') = sup | W (s ,<p) = W . (s ,tj> ' ) | 

J,Z s€S J 3 

be Wald's (pseudo) metrics [8] on S and tji respectively. 
e > 0 and 



Let 



S e = { ( t ’ s t ) 6 S : c l (t) * | } > 



= t (B , 4>) G i p : c 2 (|b|) :> 



where |b| is the length of the longest string b G B. Since 



lim C- (t) = lim c n (n) = 0 the finite sets S and ^ are 

t~H» n -^°° 

e-nets in the (pseudo) metric spaces (S,p^) and (ij> ,p 22 ^ 
respectively and hence these spaces are totally bounded ("condi- 
tionally compact" in Wald's terminology). By [8], Theorem 2.1, 

so are then the spaces anc * ^ 5p 21^* ^et, f° r ^ = 1>2,. 

* * 



(x^^r) be an equilibrium point in the finite game ,W 1 

and a pair 



* * 

Then there is a subsequence (x^ ,y^.) , i = 1,2,... 

* * 1 1 

of mixed strategies x G X, y GY such that as i -> °°, j = 1,2, 
p . n (x. ,x ) 0 and p . „ (y, ,y ) -* 0. In particular this 



j,l' k. 
implies, as i 



j > 2 k ± 
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l w i<\ >y k ) - W (x*,y*)| 0, 

i i 



( 21 ) 



x6X 



sup W 1 (x,y ) - sup W 1 (x,y )| + 0, 



1/k. 



xgX 



( 22 ) 



ft 7s 

sup W 1 (s t y k ) - sup W (x,y ) | -* 0. 



X « X l/k, 

Finally, since (x ,y ) is an equilibrium point in G 

k. 4 k. . k. . 

XI X 

sup w 1 ( x »y k ) = W 1 (x k ,y k ) 
x ®l/k. 1 1 1 

whence letting i 00 in (21) through (23) we obtain 



(23) 



Similarly, 



sup W 1 (x,y*) = W (x*,y*), 
x£X 



ft ft ft 

sup W 2 (x ,y) = W 2 (x ,y ) , 
y€Y 

& * 

so that (x ,y ) is an equilibrium point in G (c^ (t) , c^ (n) ) and 
the proposition is proved. 



Remark: We suspect that much stronger statements, namely that 
all equilibrium points are equivalent and interchangeable, can 
be proved, but we have not been able to prove this. 
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TABLE 1. 
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TABLE 
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